On Translation Corpora and Translation Support Tools: A Project Report
نویسندگان
چکیده
For more than forty years machine translation has been an active area of research and development. Several variants of MT systems have been proposed e.g., interactive systems, batch-mode systems requiring a large amount of human post-editing and systems that generate text in several languages from a common structured set of data rather than a source text. Although there are some successful systems and several products on the market, it is still the case that most translators and translation companies prefer not to use MT-systems. In fact, it has been estimated “that the current share of anything that could be called MT, pure or mixed, is well below 1% of the total translation market” (Isabelle et al, 1993).
منابع مشابه
Collecting and Using Comparable Corpora for Statistical Machine Translation
Lack of sufficient parallel data for many languages and domains is currently one of the major obstacles to further advancement of automated translation. The ACCURAT project is addressing this issue by researching methods how to improve machine translation systems by using comparable corpora. In this paper we present tools and techniques developed in the ACCURAT project that allow additional dat...
متن کاملParallel Data, Tools and Interfaces in OPUS
This paper presents the current status of OPUS, a growing language resource of parallel corpora and related tools. The focus in OPUS is to provide freely available data sets in various formats together with basic annotation to be useful for applications in computational linguistics, translation studies and cross-linguistic corpus studies. In this paper, we report about new data sets and their f...
متن کاملاستخراج پیکره موازی از اسناد قابلمقایسه برای بهبود کیفیت ترجمه در سیستمهای ترجمه ماشینی
Data used for training statistical machine translation method are usually prepared from three resources: parallel, non-parallel and comparable text corpora. Parallel corpora are an ideal resource for translation but due to lack of these kinds of texts, non-parallel and comparable corpora are used either for parallel text extraction. Most of existing methods for exploiting comparable corpora loo...
متن کاملTranslation Technology Tools and Professional Translators’ Attitudes toward Them
Today technology is an integral part of professional translation; and it is generally assumed that translators’ attitudes toward translation technology tools influence their interaction with technology (Bundgaard, 2017). Therefore, the present two-phase study seeks to shed some light on what translation technology tools are and how professional translators feel toward them. The research method ...
متن کاملA Machine Translation Toolchain for Polysynthetic Languages
We present a set of free tools for building rule-based machine translation systems for polysynthetic languages. As there are no large corpora for most of the “small” languages, it is often impossible to use statistical methods. There are some free MT tools but very little work has been done on polysynthetic languages. The aim of this project is to provide computational tools for morphological a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996